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Abstract. Modeling the evolution of a financial index as a stochastic process is a problem awaiting 
a full, satisfactory solution since it was first formulated by Bachelier in 1900. Here it is shown that 
the scaling with time of the return probability density function sampled from the historical series 
suggests a successful model. The resulting stochastic process is a heteroskedastic, non-Markovian 
martingale, which can be used to simulate index evolution on the basis of an auto-regressive strategy. 
Results are fully consistent with volatility clustering and with the multi-scaling properties of the 
return distribution. The idea of basing the process construction on scaling, and the construction itself, 
are closely inspired by the probabilistic renormalization group approach of statistical mechanics and 
by a recent formulation of the central limit theorem for sums of strongly correlated random variables. 
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1. Introduction 

Economics and mathematical finance are multi-disciplinary fields in which the tendency of 
statistical physicists to focus on universal laws has been criticized sometimes [1]. In par- 
ticular, the emphasis on scaling properties typical of many recent contributions in econo- 
physics has been regarded with skepticism by some economists, in view of the apparent 
scarcity of useful practical consequences of this symmetry [2]. 

As statistical physicists aware of the key role played by scaling and universality in the 
development of the theory of complex systems in the last decades, we do not share this 
point of view. Universal laws are necessary for building up our scientific understanding 
and we do not intend to give them up. In the specific case of scaling symmetries, it is 
perhaps fair to admit that, so far, their potential consequences in finance have not been fully 
explored and elucidated [3]. In the present note we report on recent work [4] demonstrating 
that scaling, combined with symmetries enforced by the efficiency of the market, allows 
substantial progress towards the solution of the central problem of mathematical finance: 
assuming that the time evolution of a financial index, or asset price, amounts to a stochastic 
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process, formulate a satisfactory model of this process, consistent with as many as possible 
stylized facts established by the statistical analysis of the historical series [5-9]. This 
problem awaits a full, satisfactory solution since it was first formulated by Bachelier [10]. 

The approach sketched below points out far reaching consequences of the scaling in 
time obeyed by the return probability density function sampled from the historical series 
of an index. These consequences add further strong constraints to those already implied 
by market efficiency, and suggest very plausible probabilistic rules for the process of index 
evolution. Our goal here is reached through ideas which are partly inspired by the proba- 
bilistic formulation of the renormalization group (RG) in statistical mechanics [11,12], and 
by a recent extension of the central hmit theorem to sums of strongly correlated variables 
obeying anomalous scaling [13]. To our knowledge, renormalization group ideas do not 
seem to have been applied in mathematical finance so far. 

This report is organized as follows. In the second section we recall the basic facts emerg- 
ing from the statistical analysis of the historical series, taking as example the Dow Jones 
Industrial (DJI) index. In parallel we also present the core of our derivations and stress 
their links with renormalization group ideas. In the third section we briefly describe our 
stochastic model for index evolution, while in the subsequent, fourth one we review the 
results of the simulation of the DJI index. The last, fifth section is devoted to concluding 
remarks. 



2. Stylized facts and consequences of scaling 

Let us indicate by S{t) the value of an index at time t. For our purposes here, we can 
assume that t is measured in days and S{t) represents the daily closure value. A quantity of 
interest [5-7] is the logarithmic return rf .T = ln(S{t+T)—\nS{t)) in the interval [t,t+T]. 
The values of In S used to compute rt,T are assumed to be detrended. In S'(i) i-^ In S{t) — 
p t, where p is the average linear growth over the whole time series. A probability density 
function (PDF) for this return can be sampled from sufficiently long historical time series. 
The resulting PDF, prir), does depend only on T. Indeed, being sampled with a sliding 
interval method, px conveys only a stationarized information on return occurrences. For 
T's in the range from one day to few months, pt satisfies approximately simple- scaling: 
Prir) = jSt), where the exponent D turns out be very close to 1 /2 for the indexes 
of well developed markets [14]. The scaling function g, however, is not Gaussian, and 
shows power law, Pareto tails at large |r| [5]. The scaUng of px in the case of the DJI 
index is illustrated by the collapse plot in Fig. 1. The non Gaussian form of g indicates 
that successive returns in the sampling must have strong correlations on the time range 
where scaling holds. Before mentioning other stylized facts, it is worth concentrating on 
the scaling of pr, which plays a central role in our approach. 

In the modern theory of critical phenomena [12], one can consider a finite block of N 
interacting spins and try to identify the critical conditions under which doubling the block 
size (N — !■ 2N) signals the presence of scale invariance in the system. In a phenomeno- 
logical [15] version of the probabilistic RG approach [11], this system doubling can be 
more simply implemented for hierarchical models in which the Hamiltonian depends only 
on the total sum, M, of the spins. Here the spins, and thus M, are assumed to take values 
in R. The PDF of M in a system of N spins is then indicated by qN{M) and the RG trans- 
formation yields q2N{M) as a functional of qN{M), once a suitable interaction between 
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Figure 1. Scaling collapse of the histograms of return PDF's of the DJI. Data are 
sampled over about 26000 daily closures from 1900 to 2005. The dotted curve is a 
Gaussian, while the continuous one is the best fit [18] used here to implement our 
stochastic model. 



blocks is assumed: 

/ dMidM2 e-f^^(A'^i'A'^2) QNiMi) qN{M2) 5{M - Mi - M2) 



q2N{M) = 



/ dMidM2 e^HMi,M2) g^(Mi) qN{M2) 

(1) 



In Eq. (1), Hi{AIi, AI2) is the reduced (divided by —ksT) coupling between the mag- 
netizations of the two blocks. The factor multiplying the delta function in the integrand 
is just the Boltzmann-Gibbs expression of the joint PDF for the magnetizations Mi and 
M2 of the two blocks. Fixed-point critical scaling prevails when the interactions are 
chosen in such a way that Eq. (1) is satisfied by a qjv assuming a simple-scaling form 
Q2n{M) — -^qN{^), where D is now related to the critical exponents of the model. 

We can envisage a sort of reverse RG strategy in finance. In analogy with the magnetic 
case, the scaling of prir) can be regarded as a fixed-point scaling for a 'block' of T daily 
returns. So, we can ask what kind of 'coupling' must exist between the returns of two 
successive blocks of duration T, in such a way that, as we know, P2t(?') satisfies simple- 
scaling, i.e. p2T{r) — -^Pri^)- Since there is no Hamiltonian now, the statistical 
information on this coupling is embodied in the unknown joint PDF of the returns ri and 

r2 in the successive intervals, P2t(^1i ^2)- Indeed, since the returns ri and r2 sum up to 
the return in the interval of duration 2T, this joint PDF satisfies: 

/ (2) 

P2T{r) = dridr2 P2Tiri,r2) S{r - ri - r2). (2) 
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One realizes that ri and r2 in Eq. (2) play a role analogous to that of Mi and M2 in Eq. (1). 

(2) 

Since known, one can imagine to determine this function in terms of pr on the 

basis of Eq. (2). In the magnetic RG analogy this would amount to determine the critical 
interaction conditions, once the fixed-point scaling form of g^r (M) is given. Of course, this 

(2) 

determination is not expected to be unique in general: there can be many different P2t's 
satisfying Eq. (2) for a given px- However, there are other constraints and symmetries 
helping in the search of the right solution. A well established empirical fact is that the 
average (rir2) (2) = [ dridr2 pItC^'I' ''2) rir2 must be equal to zero. This can be easily 

Pit 

verified and is in fact an obvious requisite for an efficient market. A deviation from zero of 
this average would open an arbitrage opportunity which would be immediately exploited 
and suppressed by the market. Other constraints concern the marginal PDF's: 



The validity of Eqs. (3-4) is based on the fact that both pT and sampled with 

a sliding interval method from the historical time series. This marks a difference with 
respect to the magnetic case, because there the marginal PDF's would be computed at a 
rescaled N, due to a renormaUzation effect which is excluded here for the empirical PDF's 
of finance. On the basis of Eqs. (2-4) and of the linear decorrelation of successive returns, 
it is immediate to derive that, for the scaling of px to occur, one must necessarily have 
D = 1/2. It is sufficient to express the average ((ri -|- r^)^) (2) , and to take into account 

that the scaling form of px implies that its second moment must scale as {r'^)pj. ~ T^^. 
One then finds immediately that 2T^" = {2Tf'^, i.e. £1=1/2. This result explains the 
robustness of the estimate £) ~ 1/2 emerging from the statistical analysis of all indexes in 
mature markets [14]. 

Now let us come back to the problem of expressing p^x in terms of px- If the Un- 
ear decorrelation of successive returns, i.e. (rir2) (2) = 0, would imply a complete 

Pit 

decorrelation of ri and r2, the problem would be easily solved. Indeed, independence 
would mean p^x = Pt(?'i)pt(?'2)- By substituting in Eq. (2), and using the scal- 
ing form of Px, we would then conclude immediately that D = 1/2 and a Gaussian 
g{x) = exp(— a;^/2(j^)/-\/27rcr^ are necessary for consistency. Indeed, in this case of 
independence Eq. (2) just imposes to g the property of stability which is at the basis of 
the central limit theorem and is satisfied, for finite variance a'^, by the Gaussian PDF alone 
[16]. This is even more directly verified in terms of characteristic functions (CF). For px 
the CF is pxik) = J dre^'^^ px{r) = g(T^k), and Eq.(2), together with the scaling and 
the independence conditions, simply reads 



which has^ = exp(— cr^fc^/2) as solution forD = 1/2. We know, however, that the linear 
decorrelation of returns is not implying independence: for example, the so called effect of 
volatility clustering leads to {rfr'^) ^ {fDifVl for '^^^ '^'^o successive returns. Likewise, 
a well estabUshed fact is that the absolute value of daily returns shows a strong positive 




(3) 



(4) 



.9((2T)^fc) = ~g{T^k) -giT'^k), 



(5) 



4 



Role of scaling in the statistical modeling of finance 



autocorrelation function also at a distance of months [5-7]. This autocorrelation function 
decays as a power of the time interval r separating the two days (see Fig. 2a). 

(1^ (2'] 

Indicating by P2T(fci 5^2) the CF of the joint PDF p2T,Eqs. (2-4) above read: 

(6) 
(7) 
(8) 

where D = 1/2 has been already substituted. It is then immediate to realize that a possible 
solution is simply: 



P2T (^1 ^) 




/2Tfc), 






/Tk), 




= ~9i^ 


/Tk), 



i,k2) =9{\[^ 



pf^ {k^,k2) = ~g{jTkf+ Tkl ] , (9) 



provided such a ^ characteristic function, i.e. its inverse-transform is a PDF in 

(ri, r2). This is not of course the case for any g, but one can show that there is a large 
class of CF's satisfying this requisite, as can be checked by numerics [4], or established 
on the basis of rigorous theorems [17]. The solution in Eq. (9) is of course not the unique 
possibility. However, it is strongly suggested, in first place, by the good consistency it 
demonstrates with the statistical data. When sampled over the whole history of the DJI 
index from 1900 to 2005, the histograms of the empirical conditional PDF's of a return 
r2 once a previous return of modulus |ri | has been realized, are very well reproduced by 
the analytical prediction based on Eq. (9) (Fig. 3). The analytical expression forp2j., and 
thus of P2t(''2||'''i|), is here obtained on the basis of a particular form of g [18], whose 
parameters have been fixed by a preliminary fit of g as illustrated in Fig. 1. In Fig. 3 we 
show the conditional PDF for a given |ri | at different values of T. 

A further reason in favor of our solution is the fact that the recipe in Eq. (9) which 

(2) 

constructs P2T through replacement of single argument dependence of 5 by a spherically 
symmetric dependence in (^1,^2), can be regarded as a rule of algebraic multipUcation 
of g{ki) by 3(^2). This type of multiphcation, which is straightforwardly generalized to 
more than two factors, is commutative and associative, has as neutral element g{0) — 1, 
and, most important, can be put at the basis of an extension of the central limit theorem for 
sums of strongly correlated variables [13]. In this perspective Eq. (9) can be put in a form 

pfkkiM) = g (Vrfci) ® g (Vrfe) , (lO) 

where the symbol (g) represents the non-standard multiplication [13]. The (g) multiplica- 
tion reduces to the standard one for Gaussian g's [4,13], and in this case Eq. (10) gives 

P2T (^1 ) ^2) = g (^Vrkij g (VTk2^ , consistently with Eq. (5). 



3. Non-stationary stochastic model of index evolution 

We already mentioned that the above solution in Eq. (9) can be generalized to the case of 
more than two successive intervals. Indeed, through our recipes we can construct a joint 
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(a) Log-log plot of the empirical volatility autocorrelation at time separa- 

T.l^S" l'-(*>i)ll'-(*+r.i)|-Et^o°" k(t.i)l£;s.'o" k(t+r,l)|/t„.„^ 



where tmax + r — 1 is the total length of the time series. The data refer to the DJI index 
and tmax + r — 1 is about 26000 days. The continuous line has slope ~ —0.2, the 
exponent of the power-law decay. In (b) the empirical data have been replaced by data 
from the simulation of one history. 
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Figure 3. Symbols are the empirical conditional PDF, p2T(r2||ri|), of a return r2 
following ri with |ri | = 0.02. The continuous curves are obtained on the basis of Eq. 
(9), when g results from the fit in Fig. 1 



PDF for many returns. If, for example, we consider the CF corresponding to the joint PDF 

(n) 

of n successive daily returns, p^i , this will be simply given by 



The joint PDF's for various n define a non-Markovian, self-similar stochastic process 
thanks to the algebraic properties of the (g) multiplication. It would be tempting to re- 
gard this process as the one which directly generated the history of the index to which g 
pertains. Indeed, a nice property one can deduce for it is that for such a process returns 
would be stationary. So, the ensemble PDF for returns in the process could be directly 
sampled by the sliding interval procedure yielding the empirical pr- Another nice prop- 
erty is that the process would be a martingale [19], i.e. the conditional expectation of the 
future return is always zero, independent of the conditioning history. This is embodied in 
the construction of the which are even in the their dependence on each of the ki's. 
However, there are clear indications that this simple scenario would be oversimplified, and 

(n) 

that PDF's like p]^-^' cannot directly describe the postulated stochastic process potentially 
able to generate a whole ensemble of alternative histories. 

The reasons why such a process with stationary increments would not be acceptable are 
two. In first place, for such a process the autocorrelation function of the absolute value 
of daily returns would not decrease with time, but rather be a constant, in disagreement 
with the empirical observation of a power law decay ( Fig. 2a) [5]. Furthermore, one has 



kn) = 9{ki) <^ g{k2) g{kn) 



(11) 




(12) 
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(n) 

to consider that the construction of assumes a strict simple-scaling form for px- We 
know that this simple-scaling is well obeyed by this PDF only if we restrict ourselves to 
consider its lowest moments, i.e. (|r'|')py with < 3, like we did in our argument leading 
to D = 1/2. In fact multiscaling-like effects are observed in the scaling of higher moments 
[4,14]. These effects mean that the q-th moment of scales as T^^'^\ with D{q) < q/2 
for > 3, and would not be taken into account by the stochastic model. 

A way out of the above difficulties is found if one considers that the very assumption 
of stationary increments for the process underlying index evolution is, a priori, not justi- 
fied. Stationarity is often assumed on the basis of the fact that pT is stationary by con- 
struction. However, there is no compelUng reason to do this and to identify pt with the 
PDF of the returns of the underlying process. The recent literature even reports indica- 
tions that the stochastic processes driving exchange rates could be characterized by time- 
inhomogeneities in the returns [20] . Within our scheme it is easy to embody the possibility 
of non- stationary returns in the ensemble generating process. The key is found going back 
to our arguments leading to the conclusion that D = 1/2 for the empirical pT- For the 
postulated non-stationary process driving the index, the ensemble PDF for returns like rt^r 
should be a function of both t and T, which we indicate here by Pt,T{r). Likewise, the 

joint PDF of the process corresponding to p^rp can be indicated by P^^Ti^^i ^'2)- Let us 
now consider the equations applying to these ensemble PDF's and corresponding to Eqs. 
(3) and (4) for the empirical PDF's. With t = for simplicity, one gets: 

Po,T(ri)= J dr2P^%{ri,r2), (13) 

PT,T{r2) = J dnP^%{rur2). (14) 

Eq. (2) is simply rewritten as 

fo,2T(r) = j dridr2 Po%(r-i, r2) S{r - n - r2). (15) 

Suppose further that a simple-scaling form is valid for Po,t, i- e. 

PO'T = (^) , (16) 

with an ensemble scaling function and an ensemble dimension Dg. Consider now the 
ensemble average ((ri -I- r^)'^) p(2) in the light of Eqs. (13-16) and of (rir2) p(2) = 0. 

If we want to recast the r.h.s. of Eq (14) in the form of Pq.ts we realize that we must put 
T' = aT, with a = (2^^« — l)^/^^«, in order to be consistent with the scaling assumed for 
Po,T- If De 7^ 1/2, there is an inhomogeneity in the process, measured by this rescaling 
a, which reveals an asymmetry between the first and the second T-interval considered. In 
other words, a 7^ 1 signals a preferential direction of time: the evolution of the index in the 
second interval occurs with width rescaled with respect to that of the previous one. This 
effect is consistent with causality and the rescaling T' = aT is analogous to the rescaling 
of the block size N which one would have when constructing marginal PDF's for Mi or 
M2 in the magnetic RG. Of course, in that case, there would not be causality, and the 
rescahng would apply symmetrically to both PDF's as a consequence of the correlations. 
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This argument already shows that our formalism leaves room for the construction of 
stochastic processes which are more general than that the one introduced in Eqs. (1 1-12). 

(2) 

One has to follow steps similar to those which led us to construct the solution for in 
terms of g, taking into account the presence of a 7^ 1. For the CF of Pq 2t^ example, 
we get now P(j^2T(^i' ^2) = 9e {T'^'h) O Qe ((aT)^«fc2). Similarly, for a sequence of n 



where Ui = [i"^^" — {i— 1)^^=] ^ i = 1,2, .. .n. These last CP's again fully char- 
acterize a stochastic process, which is now non-stationary. The process is consistent with 
the simple-scaling of Pq.t, but now Pt^r satisfies a more general, inhomogeneous form of 
scaUng: 



On the basis of this last equation, one can try to analyze how the effective scaling dimension 
for T of the order of the month varies as a function of t. This is illustrated in Fig. 4 for a 
value of Dp = 0.24, which, as discussed below, is directly relevant for the application to 
the DJI index. One sees that, after a rather fast increase at short t, this effective dimension 
Df.ff approaches the value 1/2, which is the asymptotic limit. Thus, if one would sample 
with sliding interval method a return PDF pT along a sufficiently long single history of the 
process consistent with Eq. (18), this PDF would show a scaling with D ~ 1/2 for T of 
the order of the month. At the same time, the initial deviation from 1/2 of the effective 
dimension shown in Fig. 4 suggests that this px could manifest multiscaling-like features. 

The problem arises now of postulating some concrete mechanism through which inho- 
mogeneity can act in generating an index history. It is natural to assume that the inhomo- 
geneity crosses over to homogeneity when t exceeds some cut-off tc- This cut-off time 
could be of the order of the autocorrelation time of volatility, i.e. several hundreds of days. 
Of course, tc should be regarded as a statistical average of the duration of many random in- 

(n) 

tervals within which the process is described by a Pq „\ corresponding to Eq. (17). At the 
junctions between these intervals, which can be imagined to coincide with relevant exter- 
nal events influencing the market, one could assume that the progression of the coefficients 
Ui is suddenly interrupted and restarted, either from the beginning (aj+i = ai), or from a 
randomly chosen stage (oj+i = Uk, with k ^ 1). 

4. Simulation of the model and results for the DJI index 

(n) 

The knowledge of P^^ allows to implement an autoregressive strategy for the simulation 
of the process. Autoregressive methods are used extensively in finance, e.g. for the imple- 

(n) 

mentation of ARCH or GARCH processes [21,22]. Suppose we consider P„i , with, e.g., 
n = 100. If we give as input the first 99 returns, the joint PDF can be used in order to 



daily returns, we can write: 




(17) 




(18) 
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Figure 4. Effective dimension -De// for 1 < T < 40 deduced from Eq. (18). This 
Deff is obtained from the behavior of the moments of Pt.r as a function of T in this 
range. 

define the conditional PDF of the return in the hundredth day. Once this return is extracted 
consistently, one can use the returns from the second to the hundredth day included in order 
to extract in a similar way the return of the 101-th day, and so on. Without entering into 
the details of how this is practically implemented [4], here we just review the results one 
can obtain. 

(n) 

By expressing P„j^ in terms of g^, whose expansion around k — is directly linked to 
that of the empirical g [4], we generate single histories supposed to imitate the one at our 
disposal with over one century of DJl index daily closures. In all cases we fix to — 500 
on average for the inhomogeneity updating and n = 100 for the auto-regressive scheme. 
Once a single history is generated, we act on it, by sliding interval sampling techniques, 
exactly in the same way one does on the true historical series. Upon varying D^, which 
is the crucial parameter in the simulation, we search for the most realistic behavior of 
the empirical volatility autocorrelation function in the range from few to about hundred 
days. Remarkably enough, for De < 1/2 the obtained autocorrelation functions behave as 
decaying power laws in this range, and the exponent becomes very close to the empirical 
one (/3 ~ 0.2) for — 0.24, (see Fig. 2b and [4]). At the same time, we can try to 
optimize by requesting a realistic agreement between the multiscaling features of the 
simulated pT and those of the empirical one. It is remarkable that the optimal according 
to this second criterion is very close again to 0.24 [4]. This is a further indication that the 
model is coherent and catches the essential statistical features of a long index history. 

A very important test for the proposed model concerns the scaling of the empirical px 
itself. A crucial limitation of simulation methods like ARCH and GARCH is that they 
generate histories for which it is not guaranteed that the sampled pt satisfies scaling. This 
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is in fact regarded as a major open problem of these approaches [7]. In Fig. 5 we report 
the scaling collapse of pt as obtained by one of the histories generated by our simulation. 
The collapse is clearly of the same quality as that reported in Fig 1 ; moreover, the scaling 
function and the exponent emerging from the collapse are very consistent with those valid 
for the historical data. 




Figure 5. Scaling collapse of the pr sampled from a single simulated history. 

The general agreement between the stylized features of the simulated histories and those 
of the the true history [4], suggest that our model based on inhomogeneous scaling and mar- 
ket efficiency catches the robust features of the stochastic component of index evolution. 

5. Conclusions 

The results reviewed in this report show the importance of scaling in building up a model 
of stochastic index evolution in finance. In the construction of the model this symmetry 
enters as a very crucial tool, in the sense that, combined with other constraints, it leads to fix 
very plausible and consistent rules of probabilistic evolution. In view of the success of the 
model, the skepticism on the practical relevance of scaling mentioned in the introduction, 
should be attenuated. A crucial factor which helps in converting scaling into a powerful 
predictive tool here, is that we regard it in a perspective which has roots in the RG approach 
to criticality. This allows even to establish novel paradigms, like the one represented by 
the inhomogeneous scaling in Eq. (18). More generally, the strategy followed here tries to 
profit of the lessons learned from decades of work in complex systems. As clearly stated 
in Ref. [23], the issue of financial market modeling should be addressed by first trying to 
focus on the universal phenomenological features, and on the basic symmetries, rather then 
privileging analytical tractability. 
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We believe that the approach presented here could have more general applicability. 
There are many natural phenomena characterized by anomalous scaling, for which part 
of the features of the model discussed here, or of the arguments leading to it, could reveal 
worth considering. These problems belong in general to the fields of multidiscipUnary 
applications of statistical mechanics. 
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